\ 



T. molitor winter larvae 



Native " Tm 12.86 ' 
Purification of 
Type III AFP 



mRNA Isolation 
encoding Tm 12.86 



Generation of 
Antiserum 

against 
Tm 12.86 



Activator 
Partial 
Isolation 



Generation of cDNA libraries 



Planned 
Further 
Isolation 



Screening cDNA libraries 



Isolating and characterizing CLONES 



Tm 13.17 



2-2 2-3 3-4 



3-9 



7-5 



Used for RI ASSAY 
Development 

Detailing 

*Assay specificity 

* Assay sensitivity 

^Numerical quantitation 
-ice grain size 
-dilution profiles 
-regression analysis 
-RI Factor estimates 
-Slope comparisons 

*Reference comparisons 
to Tm 12.86 profiles 

^Mathematical modeling 

*Automation 
-multiple samples 
-image analyses 

*Light Scattering method 
for assessing RI 

^Applications 



( Recombinant 
\ Proteins 



. / Inactive 



Generating His-Tagged 
Signal Plus and 
Signal Minus Clones 



► ( Recombinant 
^Proteins 



/ Inactive 



Redirection towards 
Recombinant Proteins 
Isolated from Bacterial 
Inclusion Bodies 



VIA 



Activity 



Quantified 



ANTIFREEZE ACTIVITY 
of Recombinant Peptides 



Capable of Further 



Enhancement 



d^. i.o 



1.25 




d^. 1.2 



(min.) 




d^F. 1.5 



NH 2 -L-T-D-E-Q-I-Q-K-R-N-K-I-S-K-E-?-Q-Q-V 



fU 

m 

m 

fU 



1 2.5 5 10 15 20 



7.5 



H 




urn 08Z "(TO 




d^. 4.42 



4.0 




Wavelength (tim) 



3 4 



m 
H 



m 
m 



m 



28S - 
18S - 




ffU 

m 
d 



kD 



1 2 3 4 5 6 7 8 




30 — 
21 — 

14 — 






d^p. 2.2 



1 2 3 4 5 6 7 8 



9 kD 




I ^ w CO CO CD LUIXXZ<(0^ 1 * 
MCS 



lacP 





A J 


bio 




otlacZ 
CMV 
ColE1 
Neo' 


n 

















1 . Construct DNA Library 

2. Isolate positive clone 



Id 



a 

fU 

W 

in 



lacP 



T3 



DNA insert 



T7 





A J 


bio 


IT 


CMV 
ColE1 
Neo' 


(l 

















3. Excise pBK-CMV phagemid 
containing cloned DNA insert 
by co-infection with helper phage 



bla' fi (-) origin (fl+T) 



SV40 polyA 



<§l DNA insert 




W. 

M 

4 



Mst 114388 
Ate/ 1 4303 
A/s/ 14231 



Ssp I 4410 
/ A/ae!134 



A/ael3369 



Nae t 3086 



MstU 2756 




Ssp I 445 
iW/i/1463 SV 40poly<A) 

SV40 3 1 splice 



T7 

5' splice 
Kpn\ 1019 

Sac! 1126 
T3 



Nhe\ 1300 
Ns/l 1900 



13 



ru 



ru 



8lt fteverie Pff»er 

5 ACAG6AAACt&CI*T&ACCFrG 3 



T3 Primer 
V A*TTAACCCTCACTMA$6S 3' 



Sol I ^ I aj* I |£a# 1 



tai» MET t3 promoter +i Sac I BSSH II 

$ ' TCACACA<»&*AACAeCT*T|ACCm*TTAC«CC *A^ICeAAATTMCCaCA£TAJtAG0eAACAAAA«:TG6A6CTC6CSCGCCTCC AfiSTCGACACTAfiTO&ATC CAAAfi" 
3' A6TGTGTCCmGrt&AnCT6SA4CTAA?&C$£mGAC^^ 



1 ii 

t III **$ 



mni in — i m i *x i i cu\ s& i t& i 

AAT?CAA*AASCTTCTC6A£AClACTKTASAGCGGCCGC(yy^ 3 * C ♦ S 

6TnTTC<^GAGClCrCAl&AAtUTCTCCCCGGCaCCC&G6TA6CTAAAA^TG0CXCCACCCCAf GGICCATTCACAT6C^TrAA<iCGGSATATCWmeCATiUtSTTAA&T&ACC(^CA6CAAAATGT 5 * ( -} 

' J <#• +1 T7 pr«wt«f 9 J | 

3* COfiSATAfCACTCAfiCATAATG r 3* TSACCGGC A$t AAAA TG 5' 



1 2 3 4 5 6 kb 




d^?. 2.5 



DNA sequence of Tm 13.17 cDNA clone 

BE 
Ek a 
m o. 
H R 

II 
1 J«3T<^TCCAA&GAATT«3G^ 

MKLLCCLJfl 

I I. I« V T V O A j^I* TEAQIEKI. H K 

121 AGATC&GCAAAAAAT<3TC^ 

ISKKCQNESG VSQEIITKA R 

181 GG3yS€XiK3TGACS?G^ 

RG DWE D DPKI.KRQVFCVARN 

241 iiCQiSCXS0SC!l^Q(X^ 

a g i* a t e s g e v v v d v i» r e k v r 

301 ggaaggti^ctgag&acgacg 

kvtdn de e teki ihkcavk r 

361 gagatactgttgaagagacggtgttcaatacttt^ 

dt veetvfntfkc vmknkp k 

421 agttctcacx^gttgaotgaa(x^c<^ 

F S F V D * 

h 
o 
I 

481 ATATAAAAATAAAGTGTTTCTGATOT 

polyadenylation signal poly (A) tall (26) 
537 AX!&GTATTCTAGAGCGGCH2GCX3GGCC^ 



<u 



CO 

OX) 



< 

a 
> 



o o 

S-H t-H 



m 



m 

y 



CO 



o 
o 



o 
-a 

.2 2 

o 



A. Mature Tm 13.17 amino acid residure 

1 LTEAQXEKLN KISKKCQNES GVSQEIITKA RNGDWEDDPK LKRQVFCVAR 
51 NAGLATESGE VWDVLREKV RKVTDNDEET EKIINKCAVK RDTVEETVFN 
101 TFKCVMKNKP KFSPVD 

B. Summary of the composition analysis for the mature Tm 13.17 
sequence: 



m 


Residue 


Numoer 


Moie rercent 


m 


A 


= Ala 


6 


5.172 




B 


- Asx 


0 


0 . 000 


m 


C 


= Cys 


4 


3 .448 


w 


D 


= Asp 


8 


6.897 


4 


E 


= GlU 


13 


11.207 


m 


F 


- Phe 


4 


3 .448 




G 


= dy 


4 


3.448 


m 


H 


= His 


0 


0 . 000 




I 


= lie 


6 


5.172 


fU 


K 


=Lys 


16 


13.793 




L 


- Leu 


5 


4.310 


d 


M 


= Met 


1 


0.862 


m 


N 


= Asn 


8 


6.897 




P 




3 


2.586 




Q 


= Gin 


4 


3.448 




R 


= Arg 


6 


5.172 




S 


= Ser 


5 


4.310 




T 


= Thr 


8 


6.897 




V 


«Val 


14 


12.069 




W 


= Trp 


. 1 


0.862 




Y 


= Tyr 


0 


0.000 




Z 


= Glx 


0 


0.000 



Molecular weight = 13171,96; Residues = 116; Average Residue Weight = 113.551 
Charge = 1 ; Isoelectric point = 7.74. 



d^*. 2.6c 



1 ftOTGOftTCCRRRGRRTTCGOCRCGRGflCTRCTftflGflTGflft 



Tn 13.1? 
8 1 



m 

m 
y 

m 

fll 

ri ! 



41 
1 

81 
28 

121 
68 

161 
118 

201 
148 

241 
188 

281 
228 

321 
268 

361 
308 

401 
348 

441 
388 



|C T t|r C T 



T C 
T C 



CTCRTTCTGT T007C 
CTCRTTCTGT TGGTC 



c a 
c R 



0 T T C ft G G C C | C I T J G [fl C I C G 
OTTCftOOC cIrItU R C T C 



ROD 
fl 0 0 



Clfllft R 
I 0 |R I C C 



T T G R 0 
T T 0 fi 0 



R fl R C TO 
C T R C T 0 



R fl |C|R 

c c clc 



I ftl R Iftl C 



C fi 0 Clfi AJ ft |R I ftl fl 

c R o clo c|ft|olfl|o 



T 0 
T 0 



T C|ft!ft|A|ft 
C fl|fl|o|fl|c 



TGRRRGTGORO 
T 0 fl R R 0 TGGRO 



T 0 
T 0 



T C 
T C 



0 c 
C G 



fl 0 R 
fl 0 ft 



0 R 
C 0 



T C R T ft R 
T C ft 7 fl fl 



C CI ft I ft 
ft Glftlo 



ft C C T C G C ft filcjO 0 T G ft C TIG G 0 fl G 0 ft C 
ft 0 C TCGCftftAIGO TGflC Tl T |g G ft 0 0 fl C 



0 ft 
0 ft 



c c 
c c 



ft ft ft c 
ft ft ft c 



T 0 ft ft ft 
T 0 R ft ft 



C 0 C 
R T 0 



C ft ftjOIT TlTlT T T G C|0 
C ft fl|c|T T|C It T T 0 C|fl 



T|G G C 

t|t t t 



C ft 
C ft 



OtGlft 

r|g|o 



ft c g clclo o j c|tIg|gIc|c ft 

C ft C TIC 0 ft R ft I T I R I Of T|C G 



cQonaTCOQQBo fth 0 fn 0 fo| r |° G | T 
c o o fl fl t c o o o fl o ft|ft r|t|t|o|r |o c] c 



C O ft 
C G ft 



clo T 

CI fl c 



G T T 
0 T T 



Gift GIG 0 ft G ft ft C 
Clft Ifllo 0 ft 0 ft ft 0 



T G A 
TOR 



0 ft 
0 fl 



GOT 
GOT 



c|r c|t g|r c|r[r1c 
o jfl c| n fl jfl c| c|r|t 



0 ft 
G ft 



0 ft ft 0 ft ft ft 
0 ft ft 0 fl ft ft 



C T 
0 C 



0 ft 0 ft ft 
0 ft 0 fl fl 



ft T 
fl T 



C ft 
T 0 



T C 
T C 



TP 

olftlo 



ft ft G T G C O CCO Tlclftlft 0 ft 
fl ft 0 T 0 C ft CIG 0 t|g|r C T G 



0 fljO fl|T 
fllftlclftic 



ft C T G T T 



ACT 



CCO 



G ft A G ft 
0 fl ft 0 fl 



ft C 0 
ft C 6 



0 T G 
C C ft 



T T 
? T 



c rFrIt ft cJTIt t[c" 
r o|ft )r o T[rjfl e je 



ft ft ft T 0 
ft fl ft T G 



TOT 
TOT 



C fl 
ft T 



T 0 ft ft 
T 0 ft ft 



ft ft 
G G 



ft C ft A G C C 
ft C ft ft 0 C C 



*|a a| i 

: \iA 



OjT T C T| C ftCCftG T TGfl 

t It t c t 



T T 0 
T T 0 



C C ft C C ft C 



0 ft C 
0 ft C 



T R 0 



T ft 
T ft 



G ft T G| 0 M WC | ft A ft 
T T IT pj T C I T 1 0 1 R ft ft 



T G 0 T G T 



0 C T T T 
G C T T T 



ft C 
0 T 



Tn 13. 17 
B 1 



Tn 13. \7 
B 1 



Tn 13. 17 
8 1 



T« 13.17 
B I 



T« 13.1? 
B 1 



T« 13. 17 
B 1 



Tn 13. I? 
8 1 



Tn 13. 17 
B 1 



Tb 13.17 
B 1 



Tn 13. 17 
8 1 



T* 13.1? 
8 1 



481 

428 



ft T jftj T 

0 c|ft|c 



ft ft 
ft fl 



RftATAARGTGTTTCTQfiTGTAflRRRRfiftftflAflflfl 



Tn 13.17 
B 1 



d^. 2,7 



M 

m 
w 

m 

$*\ 

M 

ru 



a 

fU 



Tm 13 


17 


3 


AFP- 3 




1 


Tm 13 


17 


53 


AFP™ 3 




51 


Tm 13 


17 


103 


AFP -3 







101 KCVHBNRS 108 



Percent identity: 39.8 (identical amino acids; Percent similarity: 58.3 (identical amino 
acids plus conservative amino acids). 
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CM 



CM 
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K0 
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m 

CM 
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Tm 12*86 
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Tm 13.17 
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Q 
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Bl 
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AFP-3 
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p 


R 


E 


K 


L 


K 


Q 


H 


s 


D 



E ? |Q 
C Q 



Q V 
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A 



£ S G V S 

E S G V S 
E S G V S 



Q[EJI 
E D V 



|k A 



T 
K R 



E E S L K K 



Tml3.17 


31 
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D 
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K 
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Bl 


44 


H 


K 


G 
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E 


D 


D 


P 


K 


la 


K 


H 


Q 


AFP-3 


29 


R 


H 


R 


E 




V 


D 


D 


P 


K 


L 


K 


E 


H 




V A R *S[AJS L A T 
XFKAXiEXVA 



X L K RAG F X D A 



E S G E 
1 S G E 
S G E 



Tml3.17 61 V V V 



AFP— 3 



75 X E A 
59 F Q L 



V I* R 
T F K 



H X K T 



E K 
E K 
K 



V R K 
L T R 
F 



V 


T 


D 


N 


DEE 


T 


E 


K 


X 


V 


T 


K 


D 


DEE 


S 


E 


K 


X 



X K 
V E 



K E N S E H p|e[k V D D X* V A 



K C A V K 

k cfrlv 



K C A V K 



1 




91 


R 


D 


T 


V 


E 


E 


T 


V 


F 


K 


T 


II 


K 


C 


V 


M 


K 


N 


K 


P 




Bl 


106 


E 


D 


T 


P 


E 


D 


T 


A 


F 


E 


V 


T 


& 


C 


V 


L 


K 


D 


K 


P 




AFP-3 


89 


K 


D 


T 


P 


Q 


H 


S 


S 


A 


D 


F 


F 


K 


c 


V 


H 


D 


N 


R 


S 



K 


F 


S 


P 


V 


D 


N 


F 


F 


G 


D 


X* F V 



dFip. 2.42 



2-2 



m 
m 

m 



1 GGCACGAGCAA A AfAT§A AAOTCCTCTTGTGCTTTGCGTTCGCCGCC 

MKLLL CFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I V I G A Q A A L T D E Q I Q K 



_Q A. T D 



9Z AGGAACAAGATCAGCAAAGAATGCCAQCAGGTGTCCGGAGTGTCC 
RNK-I SKECQQVS.GVS 

137 CAAGAGACGAT C G A C A A A GT CCGC A C A G GT GTCT T GGT CG AT GA T 
QETI DKVRTGVLVDD 

if** 

:| 182 CCCAAAATGAAGAAGCACGTCCTCTGCTT C T CG A A G A A A A CT GGA 

m PKM KKHVLCF SKKTG 

yt 

lB 226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGT ACTCAAAGCC 

W VAT E A G D T NV EVLKA 

^ 271 A A GOT GA AGO AT GT GG CCAGCGAC GA A G A GGT GGA C A A GA TC GT G 

L K L K H V A S D E E Y D K I V 



316 CAGAAGTGCGTGGTCA A G A A GGC C A C A C C A G A G G A A A C GGCT T A T 
Q KCVV K KATf* E ETAY 

361 GACACCTTCAAGTGT AT T T ACGAC AGC A A ACCT GATTTCTCTCCT 
D TFKCI Y D S K P D F B P 

406 ATT GAT T A ATT GTTTT GT ATTTGACTGA ATTTT GA CAAT AAA GGT 
I D 

polyadenylation signal 

451 ACT ATCGTT ATGT A A A A A A A A A A A A A A A A A 

poly (A) tail 



d%^. 3.0 



2-3 



1 GGCACGAGCAA A AEx^A AACTCCTCTT GTGCTTTGCTTTCGCCGCC 

MKLLLCFA FAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 

92 AGGAAGAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
R.N K I S K E C Q Q V S GV S 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGV LVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFS KKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
V A T E A G D T N - V ■ E VLKA 

271 AAGCTGAAGCAT GT GGCCAGCGACGAAGAAGT GGACAAGATCGTG 
KL KHVAS DEEV D KI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAAC GG CTTAT 
QKCVVKKATPE ETAY 

361 G A CACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKCI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTT GAC AAT A A A GGT 
ID* 

polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 



poly (A) tail 



2-2 G G C A C G A 
2-3 G G C A C G A 



Start 
4 

GCA AAAATGAAACTCCTCTTGTGCTTTG C(g| 
GCA AAAATGAAACTCCTCTTGTGCTTTG C T 



2-2 T T C G C C G C C A TCGTCATCGGAGCTCAGGCTCTCACCG 

2-3 T T C GC C G C C A TCGTCATCGGAGCTCAGGCTCTCACCG 

2-2 ACGAACAGAT ACAGAAAAGGAACAAGATCAGCAAAGA 

2-3 A CGAA C A GA T ACAGAAAAGGAACAAGATCAGCAAAGA 

2-2 AT GCCA GCA GGTGTCCGGAGTGTCCCAAGAGACGATC 

2-3 .A T GC CA GC A GGTGTCCGGAGTGTCCCAAGAGACGATC 



'Hi 

en 



-ferrf 

13 

ru 

m 

Q 

ru 



2-2 
2-3 

2-2 
2-3 

2-2 
2-3 

2-2 
2-3 

2-2 
2-3 

2-2 
2-3 

2-2 
2-3 



GACAAAGT CCGCACAGGTGTCTTGGTCG A|t|G A T C C C A 
GA C A A A GT C C GCACAGGTGTCTTGGTCG AlCJG A T C C C A 

AAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAAC 
A A A T GA A G A A GCACGTCCTCTGCTTCTCGAAGAAAAC 

T GGAGT GGCA ACCGAAGCCGGAGACACCAATGTGGAG 
T GGAGT GGCA ACCGAAGCCGGAGACACCAATGTGGAG 



G T A C T 
G T A C T 



A A G A 
A A G A 



CA A A G CCAAGCTGAAGCATGTGGCCAGCGACG 
CA A A G CCAAGCTGAAGCATGTGGCCAGCGACG 



GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 
GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 




GA A GGC C A CA CCAGAGGAAACGGCTTATGACACCTTC 
GAAGGCCA CA CCAGAGGAAACGGCTTATGACACCTTC 



AAGTGTATTT ACGACA 
AAGT GT AT T T ACGACAG 



g[c| 

Gil] 



AAACCTGATTTCTCTCCTA 
AAACCTGATTTCTCTCCTA 



2-2 TTGATTAATT GTTTTGTATTTGACTGAATTTTGACAA 
2-3 TTGATTAATT GTTTTGTATTTGACTGAATTTTGACAA 



2-2 T A A A G G T 
2-3 T A A A G G T 



ATCGTTAT GjTjA A A A A 
ATCGTTAT G|aJa A A A A 



Predicted Amino Acid 
Composition of 2-2 and 2-3 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 nvw. 


Length 


115 


1 microgram - 


77.859 pMoles 


Molar Extinction coefficient 


3040*5% 


1 A(280) = 


4.22 mg/ml 


isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


% by 


% by 


Amino Atid(s) 


count 


weight 


frequency 


Charged (RKHYCOE) 


48 


47.1 9 


AH fA 

41.74 


Acidic (DE) 


20 


18.90 


17.39 


Basic (KR) 


20 


20.40 


17.39 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILFWV) 


34 


27.26 


29.57 


A Ala 


6 


3.32 


5.22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


F Phe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


I He 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


L Leu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


P Pro 


4 


3.02 


3.48 


QGin 


6 


5.98 


5.22 


R Arg 


2 


2.43 


1.74 


SSer 


7 


4.75 


6.09 


TThr 


9 


7.08 


7.83 


VVal 


14 


10.80 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGlx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 



d^. 3.3 



A B C D E F MW 



'4* 



m 



w. 



hi 

IB 
fU 




41.8 
30.6 

17.8 
6.9 



d^. 3.4 



Lane 



1 



2 3 4 5 




577 bp 
483 bp 



d^. 4.0 




d^. 4.2 



Lane 1 2 3 4 5 6 



m 



55 



400 0 bp — 



Lane 



2313Q 



Tm 13.17 cDNA 



1 AGTQ^TCCAA&GAACTC 

M KI.I.CCI. IS 

61 CCCTCATTCTGTTGGIKIACAGTTCAGGCCCTGACCG^ 

LILLVTVOA AL TEAQIEK LNK 

i Forward Primer 

121 AtagKAGCSUUtAAATGTCaiAAATGAA^ 

I S K K C Q N E S G V S Q B I I T K A R 

181 GCAACGG3mCTGGGAGGftCGATCCTAAA<^^ 

NGDWEDDPKLKRQVFCVA RN 

241 AOXX^GTCTGGCCACGGAATCGGGAGAGGTG^ 

AGLATE SGEVVVDVLREKVR 

301 GGAAGGTCACTGACAACGACGAAGAAACTGAGAAMTCATCJyLTAAGTGCGCCGTCAAGA 

X'VTDN.DEETBKI INKCAVKR 

Reverse Primer 

361 Gi^TA CTGCTGAAGAGATC GGTGTTC^ 

DT VEETVFNTFKCVMKNKPK 

42 1 AGTTCTCACCAGTTGATTGAACC&CCAC^ 
F S P V D * 



48 1 ATATAAAA A TA A AGTGTTTCTGATGTA A AAA A A A AAA AAA AAA AAA AAAAAACTCG 

polyadenylation signal poly (A) tail (26) 
537 AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTO?CCACCC 



Forward Primer 
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percent % composition 
Primer A C G T MeltingTemperature(°C) 



Forward 28.6 14.3 42.9 14.3 44.0 
Reverse 25.0 31.3 6.3 37.5 44.0 




d^. 4.7 




d^. 4.8 




8000bp 
4000 bp 



3-4 



1 GGCACGAGCAA AAgTgAAACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKLLLCFAFAA 

13 47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGAT ACAGAAA 

/** f V t G A Q A AL T 0 E Q I OK 

T 



m 



ru 

lu 



92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RN KI SKECQQVSGVS 

137 CAAGAGACGATCGACAAAGTCCGCA C A GGTGTCTTGGTCGACGAT 
QETt DKVRTGVLVDD 



^ 182 CCCAAAATGAAGAAGCACGTCCTCT6CTTCTCGAAGAAAACTGGA 

H PKMKKHVLCFSKKTG 



226 GT GGCAA CCGAAGCCGGAGACACCAATGT GGAGGT ACT CAAAGCC 
ift VATEAGDTNVEVLKA 

271 AAGCTGAAGCAT GTGGC C A GCGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGGTTATTTACGACAGTAAAQCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGT ATTTGACTGAATTTTGAC AATAAA GGT 
I O * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 

poly (A) tail 



Predicted Amino Acid 



Composition of 3-4 



Molecular Weight 


12839.70 m.w. 


Length 


115 


1 microgram = 


77.883 pMoles 


Molar Extinction coefficient 


2920±5% 


1 A(280) = 


4.40 mg/mi 


Isoelectric Point 


7.14 


Charge at pH 7 


0.16 



Whole Protein Composition Analysis 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


47 


46.41 


40.87 


Acidic (DE) 


20 


18.91 


17.39 


Basic (KR) 


20 


20.41 


17.39 


Polar (NCQSTY) 


29 


24.55 


25.22 


Hydrophobic (AILFWV) 


35 


28.04 


30.43 


A Ala 


6 


3.32 


5.22 


CCys 


3 


2.41 


2.61 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


I lie 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


L Leu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.99 


5.22 


R Arg 


2 


2.43 


1.74 


S Ser 


7 


4.75 


6.09 


TThr 


9 


7.09 


7.83 


VVal 


15 


11.58 


13.04 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 



3-9 



1 GGCACGAGCAAA A lA T Gl A AACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGATGAACAGATACAGAAA 
I VI G A Q A /^L T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGAGTCCGGAGTGTCC 
RNKI SKECQQESGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
Q E T I DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAGAACTGGA 
PKMKKHVLCFSKRTG 

226 GTGGCAACCGAAGCCGGAGACACCAAT GT GGAGGTACTCAAAGCC 
VATEAGDTNVEVL KA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI.V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTT GAC AAT AAA GGT 
I D * 

polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 



poly (A) tail 



Predicted Amino Acid 



Composition of 3-9 



Analysis 


Whole Protein 


Molecular Weight 


12871.80 m.w. 


Length 


115 


1 microgram = 


77.689 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.23 mg/ml 


isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.31 


41.74 


Acidic (DE) 


20 


18.86 


17.39 


Basic (KR) 


20 


20.57 


17.39 


Polar (NCQSTY) 


30 


25,29 


26.09 


Hydrophobic (AILFWV) 


34 


27.20 


29.57 


A Ala 


6 


3.31 


5.22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.84 


9.57 


EGlu 


9 


9.03 


7.83 


F Phe 


3 


3.43 


2.61 


GGly 


4 


1.77 


3.48 


H His 


2 


2.13 


1.74 


I lie 


6 


5.28 


5.22 


K Lys 


17 


16.93 


14.78 


LLeu 


5 


4.40 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.77 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.97 


5.22 


R Arg 


3 


3.64 


2.61 


S Ser 


7 


4.74 


6.09 


TThr 


9 


7.07 


7.83 


V Val 


14 


10.78 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


X Xxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 



7-5 



1 GGCACGAGCAAAA lAT Gl A AACTCCTCTTGTGCTTTGCGTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A J^L T D E Q I Q K 

Q 92 AGGAACAAGATCAGCAAAGAGTGCCAGCAGGTGTCCGGAGTGTCC 

*p RN KI SKECQQESGVS 

CO 

m 
m 



137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QET I DKVRTGVL VDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKRTG 



I,, 226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 

C3 VAT EAGDTNVEVL KA 

kA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
fl| KLKHVASDEEVDKI V 

P 

fli 316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 

QKCVVKKAT PEETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGGCTGAATTTTGA C A AT A A A G G T 

I D • 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



Predicted Amino Acid 



Composition of 7-5 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.22 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.19 


41.74 


Acidic (OE) 


20 


18.90 


17.39 


Basic (KR) 


20 


20.40 


17.39 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILFWV) 


34 


27.26 


29.57 


A Ala 


6 


3.32 


5.22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


I He 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.98 


5.22 


R Arg 


2 


2.43 


1.74 


SSer 


7 


4.75 


6.09 


TThr 


9 


7.08 


7.83 


VVal 


14 


10.80 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 
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His -tagged Clone 2.2 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 

Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCG TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lys Leu Leu leu Cys Phe Ala Phe Ala Ala He Val He Gly Ala 
-15 -10 -5 

N-terminal of mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 276 
Gin Ala Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser 

15 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 366 
Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAG GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 

Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 

He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGACAAT AAAGGTAATA TCGTTATGTA AAAAAAAAAA 645 

AAAAAACTCG AGCACCACCA CCACCACCAC TGAGAT 681 



5,7 



His-tagged clone 2.2 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 

Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N- terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
f*$ Lys Arg Asn Lys lie Ser Lys Glu Cys Gin Gin Val Ser Gly Val 

3 10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
J$ Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 

fl 25 30 35 

j| GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 

*k Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 

L 40 45 50 

|i GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 366 

fU Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 

© 55 60 65 

?y; GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAG GTG GAC AAG ATC 411 

Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 
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His-tagged clone 2.3 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCT TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala He Val He Gly Ala 
-15 -10 -5 

N-terminal of Mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 276 
Gin Ala Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser 

15 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 366 
Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAA GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 
He Tyr Asp Ser Lys Pro Asp Phe Ser Pro lie Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGAC AAT AAA GGTACTA TCGTTATGAA AAAAAAAAAA 645 

AAAAAAACTC GAGCACCACC ACCACCACCA CTGAGAT 682 



His-tagged Clone 2.3 without signal sequence 
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TTGTTAGCGG ATGGMTTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 



TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 

Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 

25 30 35 

m 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 

fll Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 

* 40 45 50 

H GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 366 

flj Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 

|Q 55 60 65 

fy GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAA GTG GAC AAG ATC 411 

Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 

70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 

Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 

85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 

Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 

100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 



dffip. 5.40 



His-tagged Tm 13.17 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-65 -60 -55 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-50 -45 -40 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC TGG ATC CAA AGA ATT 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Trp He Gin Arg He 
-35 -30 -25 

AFP Start Codon 

CGG CAC GAG ACT ACT AAG ATG AAG TTG CTC TGT TGT CTA ATC TCC 231 

Arg His Glu Thr Thr Lys Met Lys Leu Leu Cvs Cvs Leu He Ser 

-20 -15 -10 

N-terminal of mature AFP 
CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG ACC GAG GCA CAA ATT 276 
Q Leu He Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin He 

i " 5 1 5 

W GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA 321 

tj y Glu Lys Leu Asn Lys lie Ser Lys Lys Cys Gin Asn Glu Ser Gly 

jfii 10 15 20 

4 | GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG 366 

*p Val Ser Gin Glu lie He Thr Lys Ala Arg Asn Gly Asp Trp Glu 

ip3 25 30 35 

GAC GAT CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC 411 
W Asp Asp Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn 

;|fI> 40 45 50 

11 



m 
!U 



GCC GGT CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG 456 
Ala Gly Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu 
55 60 65 

AGG GAG AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG 501 
Arg Glu Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu 
70 75 80 

AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG 546 
Lys He He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu 
85 90 95 

ACG GTG TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG 595 
Thr Val Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys 
100 105 110 

Stop Codon 

TTC TCA CCA GTT GAT TGA ACCACCACGA CTAGTAGATG GTTCAAATGG 643 
Phe Ser Pro Val Asp * 
115 

Polyadenylation signal Poly-A tail 
TGTGCTTTAC ATATAA AAAT AAA GTGTTTC TGATGTAAAA AAAAAAAAAA 693 

AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG CCCATCGTTT 743 

TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 777 



His-tagged Tm 13.17 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGC CTG ACC GAG GCA CAA ATT GAG AAA 
Gly Gin Gin Met Gly Arg Gly Leu Thr Glu Ala Gin He Glu Lys 
-5 1 5 

CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA GTG TCG 
Leu Asn Lys He Ser Lys Lys Cys Gin Asn Glu Ser Gly Val Ser 
10 15 20 

CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG GAC GAT 
Gin Glu He He Thr Lys Ala Arg Asn Gly Asp Trp Glu Asp Asp 
25 30 35 

CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC GCC GGT 
Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn Ala Gly 
40 45 50 

CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG AGG GAG 
Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu Arg Glu 
55 60 65 

AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG AAA ATC 
Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu Lys He 
70 75 80 

ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG ACG GTG 
He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu Thr Val 
85 90 95 

TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG TTC TCA 
Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys Phe Ser 
100 105 110 

Stop Codon 

CCA GTT GAT TGA CTCGAGCACC ACCACCACCA CCACTGAGAT 
Pro Val Asp * 
115 



5.42 



MW(KD) 



1 



2 




<^^. e.o 



Tm 13.17 S-graphdata 




(My. 6.3 



■i i j 



m 



1 CTTCP 
Lt 1 1 tr\ 


M AMP 




f*MPMIf"*AI PI ACQ 
v_/ri Civil OML r\oO 




rvcaCllvliy 


A 


Alanine 


Ala 


aliphatic 


mod. hydrophobic 


low 


B 


Asp or Asn 


Asx 










oysieine 


Cys 


sunnyuryi 


n yu r opn ou ic 


high 


u 


Aspartic Acid 


Asp 


aciutc 


niyniy nyurupnuic 


high 




fill i+ami^ A 


OIU 




iiiyniy iiyuiupniiii* 


high 




Phanulalonino 

mcny icu cimi i its 


Phe 


arrtmatip 
dl \Jll lallL 


myiiiy I iyui \J\J\ i\jui\j 




G 


Glycine 


Gly 


aliphatic 


mod. hydrophobic 


low 


H 


Histidine 


His 


basic, imidazole 


highly hydrophilic 


high 


1 

J 


l^nlpi irinp 


lie 


alinhatir 

Cffl^l Iwllv 


h vd ron h obi c 


low 


K 


Lysine 


Lys 


basic 


highly hydophilic 


high 


1 


LCUUMC 


Leu 




hvH mnhrthif* 
i iyvji ^ yj i luuiw 


low 


M 


Methionine 


Met 


sulfhydryl 


hydrophobic 


low 


N 

n 


Asparagine 


Asn 


amide, acidic derived 


hydrophilic 


high 


V-/ 

p 


QrnlinP 

i 1 Um IC 


Pro 


alinhatif* px/clip iminn 

Clll|JI Icailvr, wl l O , 111 III IU 


mnri h\/rlrrtnhilir* 

1 1 IUU. II VUI wfkJI ItilLr 


low 


Q 


VJILllCH MM IC 


Gin 


amirip ar*iHip Hptix/pH 

CIllllUw, HVIUIV/ UCM VvU 


hvrirnnhilir 

■ iyui vj|^i Mills 


i ny 1 1 


p 


A rnintna 


Arg 


basic 


niyuiy i tyuiujji nut/ 


hinh 


O 

o 


Serine 




alinhatif* huHrrtvul 
cmpi laiio nyuiUAyi 


I iyui upi in ii^ 


high 


j 

u 


I 1 II CUI III IC? 


Thr 


alinhatif* hvHrAYvl 


huiirnnhilir* 

i iyui K/fji nni/ 


hinh 


V 


Valine 


Val 


aliphatic 


hydrophobic 


low 


w 

X 


Tryptophan 


Trp 


aromatic 


highly hydrophobic 


low 


Y 


Tyrosine 


Tyr 


aromatic 


mod. hydrophilic 


high 


z 


Glu or Gin 


Glx 












ACD 


Any Acidic 










ALP 


Any Aliphatic 










ALH 


Any Aliphatic Hydroxy! 










ARO 


Any Aromatic 










BAS 


Any Basic 










HY- 


Hydrophobic 










HY+ 


Hydrophilic 









Position 


Tm12.84-2.2 


Tml2.84-2.3 


Tml 2.84-3 4 


Tml 2.84-3.9 


Tml2 84-7.5 


Concensus of 
Tm12 84 


Tm13.17 


Concensus with 
Tm 13,17 


B1 


Concensus with 
B1 


AFP-3 


Concensus with 
. AFP-3 1 




1 


A 


A 


A 


A 


A 


A 


A 


A 




A 


C 


N 




2 


C 


C 


C 


C 


C 


C 


G 


N 




N 


A 


N 




3 


G 


G 


G 


G 


G 


G 


A 


R 




R 


G 


R 




4 


A 


A 


A 


A 


A 


A 


C 


N 




N 


A 


N 




5 


G 


G 


G 


G 


G 


G 


T 


N 




N 


T 


N 




6 


C 


C 


C 


C 


C 


C 


A 


H 




N 


C 


N 




7 


A 


A 


A 


A 


A 


A 


C 


N 




N 


C 


N 




8 


A 


A 


A 


A 


A 


A 


T 


N 




N 


G 


N 




9 


A 


A 


A 


A 


A 


A 


A 


A 




A 


A 


A 




10 


A 


A 


A 


A 


A 


A 


A 


A 




A 


A 


A 




11 


A 


A 


A 


A 


A 


A 


G 


R 




R 


G 


R 




12 














* 


* 












13 


A 


A 


A 


A 


A 


A 


A 


A 


A? 


A 


A 


A 




14 


T 


T 


T 


T 


T 


T 


T 


T 


T? 


T 


T 


T 




15 


G 


G 


G 


G 


G 


G 


G 


G 


G? 


G 


G 


G 




16 


A 


A 


A 


A 


A 


A 


A 


A 




A 


A 


A 




17 


A 


A 


A 


A 


A 


A 


A 


A 




A 


A 


A 




16 


A 


A 


A 


A 


A 


A 


G 


R 




R 


G 


R 




19 


C 


C 


C 


C 


C 


C 


T 


Y 




Y 


C 


Y 




20 


T 


T 


T 


T 


T 


T 


T 


T 




T 


T 


T 




21 


C 


C 


C 


C 


C 


C 


G 


C/G 




C/G 


C 


C/G 




22 


C 


c 


C 


c 


C 


C 


C 


C 




C 


C 


C 




23 


T 


T 


T 


T 


T 


T 


T 


T 




T 


T 


T 




24 


C 


C 


C 


C 


C 


C 


C 


C 




C 


C 


C 




25 


T 


T 


T 


T 


T 


T 


T 


T 




T 


C 


Y 




26 


T 


T 


T 


T 


T 




G 


N 




N 








27 


G 


G 


G 


G 


G 


G 


T 


N 




N 


C 


N 




26 


T 


T 


T 


T 


T 


T 


T 


T 




T 


T 


T 




29 


G 


G 


G 


G 


G 


G 


G 


G 




G 


G 


G 




30 


C 


C 


C 


C 


C 


C 


T 


Y 




Y 


T 


Y 




31 


T 


T 


T 


T 


T 


T 


C 


Y 


C 


Y 


C 


Y 




32 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 




33 


T 


T 


T 


T 


T 


T 


A 


T/A 


T 


T/A 


T 


T/A 




34 


G 


G 


G 


G 


G 


G 


A 


R 


A 


R 


G 


R 




35 


C 


C 


C 


C 


C 


C 


T 


Y 


C 


Y 


T 


Y 




36 


G 


T 


T 


T 


G 


N 


C 


N 


T 


N 


C 


N 




37 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


C 


Y 




38 


T 


T 


T 


T 


T 


T 


C 


Y 


C 


Y 


T 


Y 




39 


C 


C 


C 


C 


C 


C 


C 


C 


T 


Y 


C 


Y 




40 


G 


G 


G 


G 


G 


G 


c 


G/C 


C 


G/C 


G 


G/C 




41 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


T 


Y 




42 


c 


C 


C 


c 


C 


C 


C 


C 


C 


C 


T 


Y 


^6 


43 


G 


G 


G 


G 


G 


G 


A 


R 


A 


R 


G 


R 




44 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


C 


Y 


f'fS 


45 


C 


C 


C 


c 


C 


C 


T 


Y 


T 


Y 


C 


Y 


'•- ■; 


46 


A 


A 


A 


A 


A 


A 


C 


N 


C 


N 


T 


N 


47 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


'%\ 


48 


C 


C 


C 


C 


C 


C 


G 


C/G 


G 


C/G 


G 


C/G 




49 


G 


G 


G 


G 


G 


G 


T 


N 


T 


N 


G 


N 




50 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 




51 


C 


C 


C 


C 


C 


C 


G 


C/G 


G 


C/G 


C 


C/G 


•:; ; . I 


52 


A 


A 


A 


A 


A 


A 


G 


R 


G 


R 


G 


R 




53 


T 


T 


r 


T 


T 


r 


r 


T 


T 


T 


C 


Y 




54 


C 


C 


c 


C 


C 


c 


C 


C 


C 


C 


C 


C 




55 


G 


G 


G 


G 


G 


G 


A 


R 


C 


N 


G 


N 




56 


G 


G 


G 


G 


G 


G 


C 


G/C 


C 


G/C 


C 


G/C 




57 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 




58 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 




59 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


C 


Y 




60 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


C 


Y 


■if' ■! 


61 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


T 


Y 


62 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




63 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


C 


N 




64 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


.:_r- ■ 


65 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 




66 


T 


T 


T 


T 


T 


7" 


C 


Y 


C 


Y 


C 


Y 




67 


C 


C 


C 


C 


C 


C 


C 


C 


A 


N 




N 




68 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 




T 




69 


C 


C 


C 


C 


C 


C 


G 


C/G 


A 


N 




N 




70 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




A 




71 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 




C 




72 


C 


C 


C 


C 


C 


C 


C 


C 


T 


Y 




Y 




73 


G 


G 


G 


G 


G 


G 


G 


G 


C 


G/C 


G 


G/C 


:r.t. ; 


74 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




75 


C 


C 


C 


T 


C 


V 


G 


N 


G 


N 


A 


N 




76 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 




77 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 


C 


N 




78 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




79 


C 


C 


C 


C 


C 


C 


C 


C 


G 


C/G 


c 


C/G 




80 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


c 


N 




81 


G 


G 


G 


G 


G 


G 


A 


R 


C 


N 


T 


N 




62 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 


C 


N 




83 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


G 


N 




84 


A 


A 


A 


A 


A 


A 


T 


A/T 


T 


A/T 


T 


A/T 




85 


C 


C 


C 


C 


C 




G 


C/G 


G 


C/G 


G 


C/G 




86 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




67 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 




88 


A 


A 


A 


A 


A 


A 


A 


A 




N 


A 


N 




89 


A 


A 


A 


A 


A 


A 


A 


A 


T 


A/T 


A 


A/T 




90 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 




91 


A 


A 


A 


A 


A 


A 


C 


N 


C 


N 


C 


N 




92 


G 


G 


G 


G 


G 


G 


T 


N 


T 


N 




N 




93 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 




94 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 




95 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 




96 


C 


C 


C 


C 


C 


C 


C 


C 


C 




G 


Y 
























N 








98 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




99 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 


G 


R 




100 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 




101 


T 


T 


T 


T 


T 


T 


T 


T 


C 


Y 


A 


N 




102 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 




103 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




104 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 




105 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 




106 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 


G 


R 




107 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 




106 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 




109 


G 


G 


G 


G 


G 


G 


A 


R 


G 


R 


G 


R 




110 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


C 


N 




111 


A 


A 


A 


A 


G 


R 


A 


R 


G 


R 


C 


N 




112 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 




113 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 




114 


C 


C 


C 


C 


C 


C 


T 


Y 


C 


Y 


C 


Y 




115 


C 


c 


C 


c 


C 


C 


C 


C 


A 


N 


A 


N 




116 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 




117 


G 


G 


G 


G 


G 


G 


A 


R 


G 


R 


G 


R 



d^?. 7.2 



§3 

m 



a 

ru 

m 



fU 



Position 


Tm 12.84-2.2 


Tm 12.84-2.3 


Tm1 2.84-3.4 


Tm12.84-3 9 


Tm12 84-7.5 


Concensus of 


Tm 17 






with 




■ .' 
Concensus .will i 


116 


C 


c 


c 


c 


c 


c 


A 


oncer ^ us 




° N 






119 


A 


A 


A 


A 


A 


A 


A 








r 


M 


120 


G 


G 


G 


G 


G 


G 






Z. 






W 


121 


G 


G 


G 


G 


G 


G 


G 


G 




G 


Q 




122 


T 


T 


T 


A 


T 


T/A 


A 


T/A 


A 


T/A 


A 


A/T 


123 


G 


G 


G 


G 


G 




A 


R 


A 


R 


G 


_ 


124 


T 


T 


T 


T 


T 


T 


A 


T/A 


A 


T/A 


T 


T/A 


125 


C 


c 


C 


C 


c 


c 


G 




G 






C/G 


126 


C 


C 


C 


C 


c 


c 


T 


Y 


T 


Y 


T 


Y 


127 


G 


G 


G 


G 


G 


G 


G 


G 








Q 


128 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 




129 


A 


A 


A 


A 


A 


A 


A 










A 


130 


G 


G 


G 


G 


G 


G 


G 


G 






Q 


G 


131 


T 


T 


T 


T 


T 


T 


T 




J 






j 


132 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 


133 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


134 


C 


C 


C 


C 


c 


c 


c 


c 




c 


c 




135 


c 


C 


C 


C 


C 


c 


G 


C/G 


c 


C/G 


T 


N 


136 


c 


c 


c 


c 


c 


C 


c 


c 


G 


C/G 


G 


C/G 


137 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


138 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


139 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


140 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


141 


G 


G 


G 


G 


G 


G 


G 


G 


c 


G/C 


G 


G/C 


142 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 


T 


N 


143 


C 


C 


C 


C 


C 


c 


T 


Y 


T 


Y 


c 


Y 


144 


G 


G 


G 


G 


G 


G 


C 


G/C 


C 


G/C 


c 


G/C 


145 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


c 


N 


146 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


147 


C 


C 


C 


C 


C 


C 


A 


N 


A 


N 


c 


N 


148 


G 


G 


G 


G 


G 


G 


A 


R 


A 


R 


A 


R 


149 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 


A 


N 


150 


C 


C 


C 


C 


C 


c 


c 


c 


G 


C/G 


c 


C/G 


151 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


152 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 


A 


R 


153 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 


154 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


155 


T 


T 


T 


T 


T 


T 


C 


Y 


c 


Y 


T 


Y 


156 


C 


C 


C 


C 


C 


C 


T 


Y 


T 




T 




157 


c 


C 


C 


c 


C 


C 


C 


c 


c 


c 


c 


c 


158 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


159 


C 


C 


C 


C 


C 


C 


C 


c 


C 


c 


c 


C 


160 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


161 


C 


C 


C 


C 


C 


C 


A 


N 


A 


N 


A 


N 


162 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 


c 


N 


163 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


c 


G/C 


164 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


165 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


166 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


167 


T 


T 


T 


T 


T 


T 


A 


T/A 


A 


T/A 


A 


T/A 


168 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


A 


N 


169 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


G 


N 


170 


T 


T 


T 


T 


T 


T 


G 


N 


T 


N 


A 


N 


171 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 


172 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


173 


T 


T 


T 


T 


T 


T 


A 


T/A 


A 


T/A 


T 


T/A 


174 


C 


C 


C 


C 


C 


C 


G 


C/G 


G 


C/G 


G 


C/G 


175 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


176 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


177 


T 


C 


C 


C 


C 


Y 


C 


Y 


C 


Y 


c 


Y 


178 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


179 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


180 


T 


T 


T 


T 


T 


T 


T 


T 


C 


Y 


c 


Y 


181 


C 


C 


C 


C 


C 


C 


C 


C 


C 


c 


C 


C 


182 


C 


c 


C 


C 


c 


C 


C 


C 


C 


c 


C 


c 


183 


c 


c 


c 


c 


c 


c 


T 


Y 


C 


Y 


C 


Y 


184 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


185 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


186 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


187 


A 


A 


A 


A 


A 


A 


C 


N 


C 


N 


C 


N 


188 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


189 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


190 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


191 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


192 


G 


G 


G 


G 


G 


G 


A 


R 


A 


R 


A 


R 


193 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 


G 


M 


194 


A 


A 


A 


A 


A 


A 


G 


R 


T 


N 


A 


N 


195 


G 


G 


G 


G 


G 


G 


C 


G/C 


G 


G/C 


G 


G/C 


196 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


c 


c 


197 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


198 


C 


C 


C 


C 


C 


C 


A 


N 


A 


N 


T 


M 


199 


G 


G 


G 


G 


G 


G 


G 


G 


C 


G/C 


G 


G/C 


200 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


c 


Y 


201 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


T 


Y 


202 


c 


C 


C 


C 


C 


C 


T 


Y 


C 


Y 


T 


Y 


203 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


204 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


C 


Y 


205 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


206 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


207 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


208 


T 


T 


T 


T 


T 


T 


G 


N 


A 


N 


A 


N 


209 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


210 


C 


C 


C 


C 


C 


C 


G 


C/G 


T 


N 


C 


N 


211 


T 


T 


T 


T 


T 


T 


G 


N 


T 


N 


T 


N 


212 


C 


C 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


213 


G 


G 


G 


G 


G 


G 


C 


G/C 


C 


G/C 


G 


G/C 


214 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


215 


A 


A 


A 


A 


A 


A 


G 


R 


A 


R 


A 


R 


216 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


217 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 


A 


R 


218 


A 


A 


A 


G 


A 


R 


A 


R 


C 


N 


G 


N 


219 


A 


A 


A 


A 


A 


A 


C 


N 


A 


N 


A 


N 


220 


A 


A 


A 


A 


A 


A 


G 


R 


C 


N 


G 


N 


221 


C 


C 


C 


C 


C 


C 


C 


C 


T 


Y 


C 


Y 


222 


T 


T 


T 


T 


T 


T 


C 


Y 


C 


Y 


C 


Y 


223 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


224 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 


G 


R 


225 


A 


A 


A 


A 


A 


A 


T 


A/T 


A 


A/T 


A 


A/T 


226 


G 


G 


G 


G 


G 


G 


C 


G/C 


A 


N 


T 


N 


227 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


228 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 


C 


N 


229 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


R 


230 


C 


C 


C 


C 


C 


C 


C 


C 


T 


Y 


T 


Y 


231 


A 


A 


A 


A 


A 


A 


C 


N 


C 


N 


C 


N 


232 


A 


A 


A 


A 


A 


A 


A 


A 


G 


R 


G 


R 


233 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


A 


N 


234 


c 


C 


C 


C 


C 


C 


G 


C/G 


G 


C/G 


C 


C/G 


235 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 



(57>(f. 7.2 emit. 



Tnrt 12 84-2.2 



Tm12&4-34 Tm12.S4-3.9 



Concensus with 



Concensus with 



Concensus with 



ru 



238 
239 
240 



245 
246 
247 
248 
249 
250 



258 
259 
260 



265 
266 
267 



274 
275 
276 
277 
278 
279 
280 



m 

CO 



283 
284 
285 
286 
287 
288 



292 
293 
294 
295 



m 



312 
313 
314 



317 
318 
319 
320 



332 
333 
334 
335 
336 
337 
338 
339 
340 



348 
349 
350 



7.2 cent. 



Concensus of 



Concensus with 



Concensus with 



MS 

m 
m 
4 



C/G 
GJC 
G/C 



CO 

m 



420 
421 



Concensus of 



Concensus with 



Concensus with 



Concensus with 



m 



m 



fU 



ru 



&>tp. 7.2 cottt. 



7 



2 
sf 



<> se se 



a < - u. <jui a 



u -* — m -i — i _»>»-> > — ui<a Jiu cat- < 5t z > < ui a — -j i- <r < zaaju 



coju. ><ca<-j><<< O 



p^jjJOit<U.<<->-0<O<JhOuiO-a«eZi£-«IXUiOOOUI»O>«OUh-O^>fl:»-0>-J>QOa.XJ«^I> JOU.»i!Xh(J><»-u«OUI 



i^SatlS^lisi s^s §5 las isJIl 11 lis ^ 3 



* 11 i il 



UjjjoJ>J><J><<t->< ui-ao:ui^-i2iOii»o<05£<u»(5>oiuiu)oo-iz^>irza:u/ui>oQa.x-i^uii<u.o-^^e<ou.-a<«K5io 



^llllMliMMah'^siasaliiii-iM 

S 3Vjj5 0 ^5 * * 33 yy* ay yy* * * * £ 3y £ £i f 5 * 



i 1 II I 



>- >- 



jh<OJ-jJ><>o<Jt-iuuiOJOJjat(jK<o<uio^Kuia)0<i«ui<>-5t2(:<Q:^00-iu)QQa3t-ii:20-i-"0-u.x<JiiJ--><u)a)<5u 



i. _u-to_i — ij><>a<-HtuuiQJui-i-io:ai-«)-<uioirt-ui0O>auio>--^a<ttscOQ->uiooQ.xj^2a-'-"O->i.>t<-i-ui><ui»(3iu 

E 



|^3^jj-iou-<il<<->- o<a<JKOuio-a^az^- «»*eiiiooojl«o>«o»uw — ax>aet-<o>-«>ooa.a£2*::cx>-iOii.c»*9»-o><»-ui<<! 



3 ascj_i_#uu.<u.<< — > — o<o<-n-ouio- z * — w*uiooo>«><»>«oii»»- — 05£>ocKO>-»>ooa.jtai<xi>-»yiL«5cs<i-o><i-ui<oo 



13 3ic^jJoii.<ii<<->-O<o<->HOuio-a^iez^-aieu)oooiuao>«oiui--ov>0!t-«9>-t>ooa.)C2)£)ei><JUtt.(i)^a!i-o><i-ui<QO 

i 



H 2^jjJoit<it<<->-o<o<Jhauio-o^itz^-w^u)ooo>o)0><oou)K-Q^>Q:i-o>-i>OQo.^a^^i>Joii.a)^i:»-o><i-uj<ofl 

i 



3 2x^jjuiL<tt.<<->-o<o<-ihOuia>axczx-«xuuoa>«o><oouii — oa*>«HO>-»>oaa.scsst*x>-»ou.a>*s£i-o><t-ui<©o 



3^-l_l_lO».-<U.<<->-0<0<Jl-QUIO-0^irZbi:-Oi£uJOOO>MO>«OU)^-Q^>0£»-0>-(>QOa^a^^I>-JOlLi8^^»-<5><»-«<00 



> > -i r— h u. o: sc moi 



I « < Z U 



u < uj t- u. a uj >hitzi-2za c/ 



< uj < >oom>iLiij>t- >-jxzo:(oz u. o o _ 



-z>Q>-Ji:<!<:-i^i><wooujuj>Oi:->o^o>>^^OKi.ujiuh<>-Qhu-^c)>>Qn^Q.Dii.wa-QtL> 




U.OJQt-^h!£U.5£UIZll)UIQ.III^>QOJ><S£t><>SeK:QI-lOI»»<OU.U.S£O>IQZ0:» 



Q, IO Q X I§| X X ^ < D Q.X 5 1 



tii x 



£ 2 £ K 

"* < u. > 



-iij<ahu.^u)i:-ihtt:>hzODii/iJ«uJi:->ui^oh>i-ujDt-Q-uioi-<u.iii>i-3£0>-ii£Oi:fLZu.ii.OQju.> 



j<QhiL^iu^ji-a:>i-zQQuiuji»iii^->ui^oi->v-ujoi-a.uiaH<ii.u)>i-^u><'^a^a.zu.b.oQju.> 



I M 1 

5 °; 2 ,9 V, . ff 



£ x 



>>>o>-nrut^>a:x:>i-ozQU)UJi-iii^ z^oo^o:Qt->iiiii)i->u.zi-u.3£o>S^zjcoi3Cu.i»Q.>o 



hZ>UJ>JY<it-l2i:ii<(0 DllJlU>D5£->0^0>>ii 



i-z>iii>j^<^ j^ixo) quiuj>qic — >o^o>>xa£<i-a.uiiiii-<>Qi-u.3£0 — >-o«K:aou.»Q. — o 



i-z>w>_j*<:*:-**x><cn Qujuj>D^->a^o>>^^<t-o.LUUJKvQKu.2i:u->0(05i:iioiJ.MQ.~D 



E>tii>-i*:<:*:-i*:x><<n ouiui>o^->a^u>>^^:<i-a.uJu]i-<>-oi-u.S£>->0«^o.Ou.nQ.-Q 



-z>tu>-i^<^-i^x><co ouiu>Q^->as<:u>>^^:<Kauiu]»-<>-QKu.s£U->Q(OJia.ou.toa-o 



*-z>ui>_jx<x:-j*:x><h oanii>Qi:->o^u>>^^<HQ.uJUji-<>-Qt-u.iCO->Q(nJ£;aQu-<no.-c 




<^*. 8.16- 




<^r. 8.2 








8.3 



aois 



aoi 



m 
m 



a 



aoos - 




nid-sairple 



category 



0.02 




8.5a. 




8.56 



A 





d^. 8.7 




e^. 8.8 



0.015 - 




d^. 8.9 




d^. sjo 



0.015 



m 



m 



:?1 



fU 



S 

g, 

.a 

0© 



u 

bX) 
I* 

g 

s 



0.01 - 



0.005 - 



0.9% NaCl 1.8% NaCl 

NaCl concentration 




d^. 8.42 




G%p. 8.43 




0.02 



- 

Cfl | 0.015 

w -a 



WD 



ft* B 

?U 2 



0.01 



0.005 



0 



3 



- w «r> m © = a * *l ~ 

<H <H ^- V) fS ^^O o 

^ <•> 0* 0* ft* Cu Cu * A* ft* ft* 



•2 S 



S P E3 



Q -60 C 
0.20c 



0.015 



0.01 



0.005- 



□ 
B 



— i — 
-4 



□ 






□ 






B 


□ 






□ 




□ 


□ 






□ 












g 


□ 




□ 








□ 






□d _ 












□ b 



\ 



0.9% NaCI baseline 



-3 -2 
log(Tm 12.86 concentration) 



0.015 



0.01 - 



0.005 - 





B 






0.9% NaCI baseline 




□ 

D ° 
□ 






E 






□ 






B 

I 


-4-a 





log(dilution) 



1 



0.015 




-2 



log(dilution) 




-5 -4 -3 -2 

log(dilution) 



d^. 8.48 




log(dilution) 



dFy. 8.20 




log(dilution) 



<5Fiy. 8.24 



8 




log(dilution) 



d&ty. 8.22 




log dilution 



8.23 




8.25 




log(dilution) 



8.27 




log(dilution) 



d^. 8.23 




<J^. 8.30 



0 02 -4 



S3 



w 

m 



fU 



a 
s 

.a 



2 

s 



0.015 - 




2 

s 



a 



I 

E 



E 

"a 

E 



E 

"a 

E 



dilution 

d%y. 8.32 




log(dilution) 



8.33 



8 



is**! 
jfU 



2 

o 

W) 

pS 

a 

"53 



6- 



4- 



i 

0 



-5 



-4 



log(dilution 



T" 

4 



t- 
6 



thermal hysteresis (°C) 



dffip. 8.85 




time (arbitrary units) 



<My. 8.3G 




0 0.5 1 1.5 2 2.5 



log(time) 



d^. 8.37 



"Sandwich" method of R.L 
assessment 



1. 



2. 




10 mm diameter 
cover glass 



liquid samples 



3. FREEZE ON —80 C 

ALUMINUM PLATE (-10 MIN.) 



4. PLACE ON COLD STAGE, 
ANNEAL AT -6 C UP TO 
12+ HOURS 



<My. 8.38 



8.39 



d^p. 8.40 



* 



DNA sequence of Tm 13.17 cDNA clone 
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MKLLCGLX S 

61 CtX&CATOCtt^^ 

LI"LLVTVOA j|L TEAQIEKliNK 

121 1^*0*00^ 

ISKKCQNE SGVS QE I I T K A R 

181 GCAACGGTGACTGGGAGGACGAT^ 

NGDWEDDPKLKRQVFCVARtt 

241 ACGCJCGGTCTGGCCaCGG^ 

AGLATESGEVVVDVLREKVR 

P 

301 GQftAGGTC&CTG&C^CGft^^ 

KVTDHDEETEKI INKCAVKR 

T> ^ . 

361 tiftji^ 

DTVEETVFNTFKCVMKNKFK 

(y . 

421 AGCT ^ShSaCX!^^ 

F S P V D * 

h 
o 
I 

481 ATA¥AIUU*A*AAA<^^TC^ 

polyadenylation signal poly (A) tail (26) 
537 AGAGTATTCTAGAGCGGCCGCGKK^ 



<Mf. 843 



• 



p 

1 GGCACGAGCAAA A (AT Gj A A ACTCCTCTTGTGCTTT G tC GTTCGCCGCC 

MKLLLCFAFAA 

P ^ G 

47 AT CGT CAT pl GGA GCT CAGGCT lCT CACCGACGA A cTTSj AT AC AGA A A 
I V I G A Q A AL T D E Q I Q K 

V 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTC C Ka G A GT GT CTc 
RNK I S K ECQQVSGV S 

V 

137 CAAGAj GACGATCQACA AAGTCCGCACAGGT GTCTTGGTCGATGAT 
Q E TI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVL CF SKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VAT EAGDTNVEVLKA 



%Q 
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Ni 
iffl 

tt\ 
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n 271 AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 

■Fh KLKHVASDEEVDKI V 

tM P 

fy 316 CAGAAGT GCGT GGTCA A G A A G G C C |A C A C C A G A G G A A ACGGCTT AT 

m QKCVVKKATPEETAY 

C3 p Gr 

IU 361 GACAC Cl T TCAAGTGTATTTACGACAGCAAACCT G A T T T |C T C T C C T 

D T F KCI YDSKPDFSP 

Gr 

406 ATT GATTXaI TT GTTTTGT ATTTGACTGAATTTT GA CAAT A A A GGT 
I D * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



